feat(config): index extra source paths from config.json (+ pending core/TTS work)#37
Merged
Conversation
…scan - Broaden built-in redaction: OpenAI (sk-), AWS (AKIA/ASIA), Google (AIza), Slack (xox), SendGrid, and URL-embedded credentials; case-fold IBANs. - Apply the secret-like file skip list to the research code scan so the MCP research tool cannot surface lines from .env-family, key, or credential files. - Widen the secret-file skip list (.env.*, id_rsa family, credentials, extra crypto extensions) behind a shared isSensitiveFilePath helper. - Add a shared text normalizer (text.ts) used by the scan.
…ead code - Reuse the shared normalizeForMatch/tokenize (text.ts) and add isRecord (guards.ts), replacing duplicated copies in query, embeddings, ingest, config, semantic-config, and store. - Centralize SOURCES_FILE_HEADER, the MIMIR_PROJECT_ROOT env name, the fast-glob ignore list, OCR image extensions, and the agent-kit manifest so doctor, skill, files, parsing, and init share one source of truth. - Export mcp searchOptions/projectRelativeGoldenPath for testing; remove the unused KbCommand type alias.
Add behaviour tests for the built-in redaction patterns and disabled passthrough, the MCP topK clamp and goldenPath traversal guard, config validation and the mcpMaxTopK env override, the security-audit redaction/gitignore warnings, the broadened secret-file skips, the research code-scan secret exclusion, and the agent-kit install/doctor contract.
Add a --lang (en|es|fr, default fr) option to `mimir audio` and `mimir-tts render`
that selects a self-contained per-language MMS model (Xenova/mms-tts-{eng,spa,fra})
for the offline Transformers.js path and a native Microsoft neural voice for the
Edge path. Behaviour is unchanged when --lang is omitted. The supported languages
live in a single TTS_LANGUAGES source with an isTtsLanguage guard reused by both
CLIs, and the language is reported in render results, doctor output, and the
audio-summary skill.
Warn contributors that scripts/public-surface-smoke.mjs scans every tracked file (tests included) and to build secret-shaped fixtures at runtime.
Add a `sources` array to .mimir/config.json so projects can declare extra file, directory, and glob paths (with `!` exclusions) without a separate sources.txt. Entries are classified like sources.txt lines and merged with it, so existing projects keep working; `mimir init` no longer writes a sources.txt for new projects. Document every config field and the `sources` syntax in the README, update the doctor hint, AGENTS.md, and the CLI reference. Cover config.sources ingestion, the legacy-file merge, and config parsing in tests. Rebuild the committed core dist.
|
🎉 This PR is included in version 1.3.0 🎉 The release is available on:
Your semantic-release bot 📦🚀 |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Integrates the pending
developwork and adds the new inlinesourcesconfig field. Targetsdevelop; a follow-updevelop -> mainPR cuts the release.New — inline
sourcesin.mimir/config.jsonsources: string[]accepts file, directory, and glob paths plus!exclusions, classified likesources.txtlines and merged with it.mimir initno longer writes asources.txt; new projects get a self-containedconfig.json. The legacy file is still read when present andmimir sources add/listkeep working.sourcessyntax; doctor hint, AGENTS.md, and the CLI reference updated.Also included (already on
developlocally)Validation
distregenerated for both packages.pnpm validate) runs in CI before any publish.Release impact
featcommits bump the minor version → 1.3.0 when this reachesmain.